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Abstract: The present invention relates to nucleotide sequences of the NITl gene and amino acid 
sequences of its encoded proteins, as well as derivatives and analogs thereof. Additionally, the present 
invention relates to the use (^) of nucleotide sequences of NIT] genes and amino acid sequences of their 
encoded proteins, as well as derivatives and analogs thereof and antibodies thereto, as diagnostic and 
therapeutic reagents for the detection and treatment of cancer. The present invention also relates to 
therapeutic compositions comprising Nit 1 proteins, derivatives or analogs thereof, antibodies thereto, 
nucleic acids encoding the Nitl proteins, derivatives, or analogs and NITl an'tisense nucleic acids, and 
vectors containing the NITl coding sequence. 
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Detailed Description _^ 

NITRILASE HOMOLOGS FIELD OF THE INVENTION The present invention generally relates to the 
field of oncology and tumor suppressor genes, and more particularly to the structure and function of the 
NIT] gene, the structure of its encoded proteins, and the use of NIT] genes and the NIT] related genes 
and their encoded proteins and vectors containing the NIT] coding sequence as diagnostic and 
therapeutic reagents for the detection and treatment of cancer. 

15 BACKGROUND OF THE INVENTION Introduction The present invention relates to nucleotide 
sequences of the NIT] gene and amino acid sequences of its encoded proteins, as well as derivatives and 
analogs thereof Additionally, the present invention relates to the use of nucleotide sequences of NIT] 
genes and amino acid sequences of their encoded proteins and vectors containing the NIT] coding 
sequence, as well as derivatives and analogs thereof and antibodies thereto, as diagnostic and therapeutic 
reagents for the detection and treatment of cancer. The present invention also relates to therapeutic 
compositions comprising Nitl proteins, derivatives or analogs thereof, antibodies thereto, nucleic acids 
encoding the Nitl proteins, derivatives, or analogs, and NIT] antisense nucleic acids, and vectors 
containing the NIT] coding sequence. 

A-pproaches to Elucidation and Characterization ofNITI The tumor suppressor gene FHIT encompasses 
the common human chromosomal fragile site at 3pl4.2 and numerous cancer cell bi-allelic deletions. 

To study Fhit function, Fhit genes in D. melanogaster and C. elegans were cloned and characterized. The 
Fhit genes in both of these organisms code for fusion proteins in which the Fhit domain is fused with a 
novel domain showing homology to bacterial and plant nitrilases; the D. melanogaster fusion protein 
exhibited diadenosine triphosphate (ApppA) hydrolase activity expected of an authentic Fhit homolog. 

In human and mouse, the nitrilase homologs and Fhit are encoded by two different genes, FHIT and 
NIT], localized on chromosomes 3 and I in human, and 14 and I in mouse, respectively. Human and 
murine NIT] genes were cloned and characterized, their exon-intron structure, their patterns of 
expression, and their alternative mRNA processing were determined. 

The tissue specificity of expression of murine FHIT and NIT] genes was nearly identical. Typically, 
fusion proteins with dual or triple enzymatic activities have been found to carry out specific steps in a 
given biochemical or biosynthetic pathway; Fhit and Nitl, as fusion proteins with dual or triple 
enzymatic activities, likewise collaborate in a biochemical or cellular pathway in mammalian cells. 

Importance of MIT The human FHIT gene at chromosome 3pl4.2, spanning the constitutive 
chromosomal fragile site FRAM, is often altered in the most common forms of human cancer and is a 
tumor suppressor gene. The human FHIT gene is greater than one megabase in size encoding an mRNA 
of 1.1 kilobases and a protein of 147 amino acids. 

The rearrangements most commonly seen are deletions within the gene. 

These deletions, often occurring independently in both alleles and resulting in inactivation, have been 
reported in tumor-denived cell lines and primary tumors of lung, head and neck, stomach, colon, and 
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other organs. In cell lines derived from several tumor types, DNA rearrangements in the FHIT locus 
correlated with RNA and/or Fhit protein alterations. 

Because the inactivation of the MIT gene by point mutations has not been demonstrated conclusively 
and because several reports have shown the amplification of aberrant-sized MIT reverse trans cription- 
PCR (RT-PCR) products from normal cell RNA, a number of investigators have suggested that the MIT 
gene may not be a tumor suppressor gene. On the other hand it has been reported that re-expression of 
Fhit in lung, stomach and kidney tumor cell lines lacking endogenous protein suppressed tumorigenicity 
:in vivo in.4 out of 4 cancer cell lines. This suggests that MIT is indeed a tumor suppressor gene. It is 
noted that a report has suggested that Fhit enzymatic activity is not required for its tumor suppressor 
function. 

Fhit protein is a member of the histidine triad (HIT) superfamily of nucleotide binding proteins and is 
similar to the Schizosaccharomyces pombe diadenosine tetraphosphate (AP4A) hydrolase. Additionally 
it has been reported that, m vitro, Fhit has diadenosine triphosphate (ApppA) hydrolase enzymatic 
activity. 

Neither the in vivo function of Fhit nor the mechanism of its tumor suppressor activity is known. 
Nonetheless, genetic, biochemical and crystallographic analysis suggest that the enzyme- substrate 
complex is the active form that signals for tumor suppression. One approach to investigate function is to 
investigate Fhit in model organisms such as Drosophila melanogaster and Caenorhabditis elegans. 

The present invention involves the isolation and characterization of the NIT] gene in these organisms. 
Fhit occurs in a fusion protein, Nit-Fhit, in D. 

melanogaster and C. elegans, but FHIT and NIT] are separate genes in mammalian cells. The human and 
mouse NIT] genes are members of an uncharacterized mammalian gene family with homology to 
bacterial and plant nitrilases, enzymes which cleave nitriles and organic amides to the corresponding 
carboxyHc acids plus ammonia. 

SUMMARY OF THE INVENTION Accordingly, it is an object of the present invention to purify a 
NIT] gene. 

It is a further object of the present invention to purify a NIT] gene, wherein the purified gene is a human 
gene. 

It is an object of the present invention to purify a NITl gene, wherein the purl led gene is a manmialian 
gene. 

It is an object of the present invention to purify a Nitl protein. 

It is another object of the present invention to purify a Nitl protein, wherein the purified protein is a 
human protein. 

It is another object of the present invention to purify a Nitl protein, wherein the purified protein is a 
mammalian protein. 

Yet another aspect of the present invention is a purified protein encoded by a nucleic acid having a 
nucleotide sequence consisting of the coding region of SEQ ID NO: I (Figure 6). 

Another aspect of the present invention is an antibody capable of binding a Nitl protein. 

It is another object of the present invention to isolate a nucleic acid of less than 100 kb, comprising a 
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nucleotide sequence encoding a Nit I protein. 

Another object of the present invention is a pharmaceutical composition comprising a therapeutically 
effective amount of a Nitl protein; and a therapeutically acceptable carrier. 

Another object of the present invention is a method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amount of a molecule that 
inhibits Nitl function. 

Another aspect of the present invention is a method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amoxmt of a molecule tHat 
enhances Nitl function. 

It is yet another aspect of the present invention to diagnose or screen for the presence of or a disposition 
for developing a disease in a subject, comprising detecting one or more mutations in NIT] DNA, RNA 
or Nitl protein derived from the subject in which the presence of said one or more mutations indicates 
the presence of the disease or disorder or a predisposition for developing the disease or disorder. 

It is yet another aspect of the present invention to treat a disease or disorder v^ith a vector containing the 
coding segment of the NIT] gene. 

BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 . A sequence comparison of human, murine, D. 
melanogaster, and C. 

elegans Nitl and Fhit proteins. Identities are shown in black boxes, similarities are shown in shaded 
boxes. For human and mouse FHIT GenBank accession numbers are U46922 and AF047699, 
respectively. 

Fig. 2. Northern blot analysis of expression of NIT] and FHIT mRNAs in murine and human tissues, as 
well as in D. melanogaster, and C elegans. (A) Mouse multiple tissues Northern blot. Lanes 1-8: heart, 
brain, spleen, lung, liver, skeletal muscle, kidney, and testis. (Top) Fhit probe; (Middle) Nit] probe; 
(Bottom) actin probe. (B) Human blot, NIT] probe. Lanes 1-8: heart, brain, placenta, lung, liver, skeletal 
muscle, kidney, and pancreas. (C) Lanes 1 and 2: D. 

melanogaster adult, D. melanogaster embryo; D. melanogaster Nit-Fhit probe. 

Lane 3: C. elegans adult; C. elegans Nit-Fhit probe. 

Fig. 3, Genomic organization of human and murine NIT] genes and D. 

melanogaster and C. elegans Nit-Fhit genes. (A) Exon-intron structure of the genes. (B) Alternative 
processing of human NIT] gene. 

Fig. 4. Cleavage of ApppA by D. melanogaster Nit-Fhit. At indicated times of incubation, samples were 
spotted on TLC plates with appropriate nucleotide standards. 

Fig. 5. Analysis of alternative transcripts of human NIT] by RT-7PCR. RT- PGR of HeLa RNA was 
performed with primers in different exons. Lanes 1-6: 

exons I and 3 (transcript 2); exons IC and 3 (transcript 5); exons I A and 3 (transcripts 3, upper band and 
4, lower band): exons 2 and 3 (transcripts 2-4); exons I and I C (transcript 5); and exons I and 2 
(transcript 2). 
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Fig. 6. Highly conserved sequence of human, murme, D. melanogaster, and C elegans NIT] gene. (SEQ 
ID NO: 1). 

DETAILED DESCRIPTION Genoinic and cDNA clones One million plaques of a mouse genomic 
library (bacteriophage library from strain SVJ129, Stratagene, La Jolla, CA) and one hundred thousand 
plaques of a D. melanogaster genomic library were screened with corresponding cDNA probes. Clones 
were purified and DNA was isolated. Sequencing was carried out using Perkin Elmer thermal cyclers 
and ABI 377 automated DNA sequencers. 

DNA pools from a human BAC library (Research Genetics, Huntsville, AL) were screened by PCR with 
NIT] primers (TCTGAAACTGCAGTCTGACCTCA (SEQ ID N0:2) and 

CAGGCACAGCTCCCCTCACTT (SEQ ID N0:3)) according to the supplier's protocol. The DNA 
from the positive clone, 3 1 KI 1 , has been isolated using standard procedures and sequenced. 
Chromosomal localization of the human NIT] gene was detennined using a radiation hybrid mapping 
panel (Research Genetics) according to the supplier's protocol and with the same primers as above. To 
map murine Nit] gene, Southern blot analysis of genomic PHXM. ph DNA from progeny of a (AEJIGn- 
a bpHla b spretus)F I x AEJIGn-a b /a bh P backeross was performed using a full length murine Nit] 
cDNA probe. This probe detected a imique 2.0 kb Dral fragment in AEJ DNA and a vmique 0. 75 kb 
fragment in M. spretus DNA. Segregation of these fragments were followed i D 180 N2 offspring of the 
backeross. Additional Mit markers (DIM04, DIM35, and D]Mit209) were typed from DNA of 92 mice 
by using PCR consisting of an initial denaturation of 4 minutes at 94'C followed by 40 cycles of 94'C for 
30 seconds, 55'C for 30 seconds and 72'C for 30 seconds. Linkage analysis was performed using the 
computer program SPRETUS MADNESS: PART DEUX. 

Human and mouseNITI expressed sequence tag (EST) clones were purchased form Research Genetics. 
The sequences of human and murine NIT] genes and cDNAs and D. melanogaster and C. elegans Nit- 
Fhit cDNAs have been deposited in GenBank. 

In situ hybridization D. melanogaster polytene chromosome spreads were prepared from salivary glands 
of third-instar larvae as described. NitFhit DNA fragments were labeled with digoxigenin- 1 1-dUTP 
using a random-primed DNA labeling kit (Boeringer Mannheim, Indianapolis, IN), and were used as 
probes for the chromosomal in situ hybridization. Hybridization was for 20 hours at 37C in 
hybridization buffer: 50% fon-namide, 2x standard saline citrate (SSQ, 10% dextran sulfate, 400 mg/ml 
salmon spenn DNA. Antidigoxigenin-fluorescein antibodies (Boehringer Mannheim) were used for 
detection of hybridizing regions. 

DNA was counterstained with Hoechst 33258 (Sigma, St. Louis, MO). The slides were analyzed by 
fluorescence microscopy. For in situ hybridization, embryos were fixed and processed as described 
previously, except that single- stranded RNA probes were used. Full length NitFhit cDNA was cloned 
into BluescriptU KS+ vector and used to synthesize antisense RNA probes with the Genius 4 kit 
(Boehringer Mannheim). 

RT-PCR, Northern and RACE analysis Human and mouse multiple tissue northern blots (Clontech, Palo 
Alto, CA) were hybridized with corresponding NIT] cDNA probes and washed using the supplier's 
protocol. For the HeLa cell line, total RNA was isolated from 1-5 x 108 cells using Trizol reagent 
(Gibco BRL, Gaithersburg, MD). D. melanogaster PolyA+ RNA was purchased from Clontech. Three 
~Lg of poly A+ RNA or 15 gg of total RNA were electrophoresed in 0.8% agarose in a borate buffer 
containing formaldehyde, transferred to HybondN+ membrane (Amersham, Arlington Heights, IL) 
using standard procedures and hybridized as described above. For RT-PCR, 200 ng of poly A+ RNA or 3 
~tg of total RNA were treated with DNasel (amplification grade, Gibco BRL) following the 
manufacturer's protocol. DNase- treated RNA was used in reverse transcription (RT) reactions as 
follows: 10 nM each dNTP, 100 pmoles random hexamers (oligo (dT) priming was used in some cases), 
DNasel treated RNA, and 200 units of murine leukemia virus (MuLV) reverse transcriptase (Gibco 
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BRL), in total volume of 20 Itl were incubated at 42'C for I hour followed by the addition of 10 ~tg 
RNase A and incubation at 37C for 30 min. One ~L1 of the reaction was used for each PGR reactioii. 
PGR reactions were carried out under standard conditions using 10 pmoles of each gene-specific primer 
and 25-35 cycles of 95' 30", 55-60' 30", 72' 1'. Products were separated on 1.5% agarose gels and 
sometimes isolated and sequenced or cloned and sequenced. Oligo (dT)-primed double-stranded cDNA 
was synthesized by using procedures and reagents from the Marathon RAGE cDNA amplification kit 
(Glontech); the cDNA was ligated to Marathon adapters (Glontech). 3' and 5' RACE products were 
generated by long PGR using gene-specific primers and the API primer (Glontech). To increase the 
specificity of the procedure, the second PGR reaction was carried out by using nested gene-specific 
primers and the AP2 primer (Glontech). PGR reactions were perforriied according to the Marathon 
protocol using the Expand long template PGR system (Boehringer Mannheim) and cycles of. 94' 30", 60' 
30", 68' 4'. RAGE products were electrophoresed, identified by hybridization and sequenced. Degenerate 
FHIT primers were: 

GTNGTNGGNGGNGAYGTNGT (SEQ ID N0:4) and AGRTGNAGRTGYTTNAGNGTYTGNGC 
TSEOlD NO:5)7D. MelaiiopstefFhit RAGE and^^^^ 

GGGGCTTTGTGGGGTGGAGTG (SEQ ID N0:6) and GGGTGGGGGAAGTTGTGTGGT (SEQ ID 
NO-7) C elegans Fhit RAGE and RT-PGR primers were: GTGGGGGGTGGTGAAACTGG (SEQ ID 
NO- 8) and TGGGGAGGATGAAGAAGTGGG (SEQ ID NO: 9). Human NIT] RT-PGR primers were: 
GGGGTGGGGATGGGAGGGT (SEQ ID NO:10) (exon 1); GAGGTAGTGGGTATGGGGTG (SEQ ID 
NO- 1 1) (exon I a): GGTGGGAAGTGGAGAGGTAAG (SEQ ID NO: 12) and 
AAAGTGAAGGGTGTTTGGTGTGAG (SEQ ID N0:13) (exon Ic); TGGGGTTGATGAGGAGGGGT 
(SEQ ID NO: 14) and GTGGGGTGAGGAGAAAGTAGTG (SEQ ID l^JO: 15) (exon 2); 
GGTTGTGTGGGGTGGATGTTA (SEQ ID NO: 1 6) (exon 3). 

Protein expression and enzymatic characterization The NIT-FHIT cDNA was amplified with primers 
TGAGGTGGAGATATGTGAAGTGTAGTTAATAGGAGG (SEQ ID NO: 17) and 
TGGGTAGGTGGAGTAGGTTATGTGG (SEQ ID NO: 1 8), digested with Ndel and Kpnl, and cloned 
into plasmid pSGA02 as a Ndel-Kpnl fi-agment. 

Escherichia coli strain SGIOO transformants were grown in Luria-Bertani with 100 ~tg/ml of ampicillin 
and 15 ~tg/ml of chloramphenicol at 15'G. When the culture reached an optical density (600 nm) of 
0.25, isopropyl B-D- thiogalactoside was added to a final concentration of 200 ptM. NitFhit protein was 
purified from inclusion bodies as described. Briefly, the cell pellet from a I-liter culture was resuspended 
in 5 0 ml of 20 mM Tris-HGI (pH 7.5), 20% sucrose, I MM EDTA and repelleted. Outer cell walls were 
lysed by resuspension in ice-water. Spheroblasts were pelleted, resuspended in 140 mM NaGl, 2.7 mM 
KGI, 12 mM Na-P04 (pH 7.3), 5mM EDTA, 500mM phenylmethylsulfonyl fluoride, 1 [tg/ml leupeptin 
and ~tg/ml of aprotinin, and sonicated. The resulting inclusion body preparation was washed and 
solubilized in 5 M guanidinium hydrochloride, 50mM Tris- HGI (pH 8.0), 5mM EDTA. Soluble NitFhit 
protein was added dropwise to 250ml of 50mM Tris-HGI (pH 8.0), ImM DTT, 20% glycerol at 40'G. 
After a 14 hour incubation, the 13 -kg supernatant was concentrated 100-fold with a Gentricon filter. A I 
-liter culture yielded approximately 200 ~Lg of partially purified, soluble NitFhit. Appp A hydrolase 
activity was assayed at 30'G in 20 ~il of 50nim Na-HEPES pH 7.5, 10% glycerol, 0.5 mM MnG12, 
4mM ApppA, I [M NitFhit. 

TLG plates were developed as described. 

Gloning and characterization of D. melanogaster and G el—.gans Fhit homolQgs To obtain D. 
nielanogaster Fhit sequences, degenerate primers were designed in the conserved regions of exons 5 and 
7 of human FHIT. RT-PGR experiments with these primers and D. melanogaster RNA resulted in an - 
200 bp product, which when translated showed -50% identity to human Fhit protein. This sequence was 
used to design specific D. melanogaster Fhit primers. 5' and 3' RAGE with these primers resulted in -1.5 
kb full length cDNA (including polyadenylation signal and Poly(A) tail) encoding a 460 amino acid 
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protein with a 145 amino acid C-terminal part homologous to human Hit (40% identity and 47% 
similarit)') and a 3 15 amino acid N-terminal extension (Fig. 1). Northern analysis (Fig. 2C) showed a 
singer band of -1 .5 kb in both embryo and adult D. 

melanogaster confirming that the full length cDN A has been cloned. 

The 460 amino acid predicted protein sequence was used in a BLASTP search. Of the top 50 scoring 
alignments, 22 aligned with the 145 residue C- terminal segment (1 7hit-related sequences) and 28 
—aligned with the 3 15 residue N- terminal segment. The 28 sequences aligning with the N-terminus were 
led by an uncharacterized gene from chromosome X of Saccharomyces cerevisiae (P- value of 1 .4 x 10- 
41), followed by uncharacterized ORFs of many bacterial genomes and a series of enzymes from plants 
and bacteria that have been characterized as nitrilases and amidases. Thus, the 460 amino acid predicted 
protein contains an N- terminal nitrilase domain and a C-terminal Fhit domain and was designated 
NitFhit. 

The D. melanogaster Nit-Fhit cDNA probe was used to screen a D. 

melanogaster lambda genomic library. Sequencing of positive clones revealed that the gene is intronless 
and, interestingly, the 1.5-kb Nit-Fhit gene is localized within the 1.6-kb intron I of the D. melanogaster 
hornolog of the murine glycerol kinase (Gyk) gene. The direction of transcription of the Nit-Fhit gene is 
opposite to that of the Gyk gene (Fig. 3 A). It is not known if such localization affects transcriptional 
regulation of these two genes. 

The cytological position of the Nit-Fhit gene was determined by in situ hybridization to salivary gland 
polytene chromosomes. These experiments showed that there is only one copy of the sequence which 
was localized to region 61 A, at the tip of the left arm of chromosome 3. Digoxigenin-labeled RNA 
probes were hybridized to whole-mount embryos to determine the pattern of expression during 
development. Nit-Fhit RNA was uniformly expressed throughout the embryo suggesting that NitFhit 
protein could be important for most of the embryonic cells. 

Because human Fhit protein and the D. melanogaster Fhit domain were only 40% identical, to show that 
the authentic D. melanogaster Fhit homolog was cloned, its enzymatic activity was tested. Fig. 4 shows 
that recombinant D. 

melanogaster NitFhit is capable of cleaving ApppA to AMP and ADP and therefore possesses ApppA 
hydrolase activity. 

C. e I gc„ a n Fhit genomic sequences were obtained from the Sanger database (contig Y56A3) by using 
BLAST searches. 5' and 3' RACE with C. elegans Fhit specific primers yielded a 1 .4-kb cDNA 
(including polyadenylation signal and Poly(A) tail) coding for a 440 amino acid protein (Fig. 1). 
Northern analysis (Fig. 2Q showed a single band of a similar size in adult worms. Similarly to D. 

melanogaster, the C. elegans protein contained an N-terminal nitrilase domain and a C-terminal Fhit 
domain (Fig. 1) with 50% identity and 57% similarity to human Fhit. Comparison between C. elegans 
Nit-Fhit cDNA and genomic sequences from the Sanger database revealed that the C. elegans Nit-Fhit 
gene comprises 8 exons and is more than 6.5 kb in size (Fig. 3A); the nitrilase domain is encoded by 
exons 1 -6, and the Fhit domain is encoded by exons 6-8. D. melanogaster and C. 

elegans NitFhit proteins are 50% identical and 59% similar and exhibit several conserved domains (Fig. 
1). 

Cloning and characterized of human and murine NIT cDNAs and genes Because Fhit and nitrilase 
domains are part of the same polypeptides in D. 
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nielanogaster and C. elegans, it is reasonable to suggest that they may be involved in the same 
biochemical or cellular pathway(s) in these organisms. Because nitrilase homologs are conserved m 
animals, the mammalian nitrilase homologs were cloned as candidate Fhit-mteractmg protems. 

To obtain human and murine NIT] sequences, the D. melanogaster nitrilase domain sequence was used 
in BLAST searches of the GenBank EST database. 

Numerous partially sequenced human and murine NIT] ESTs were found. All mouse Nit] ESTs were 
identical, as were all human NIT] ESTs, suggesting the presence of a single NIT] gene m mouse and 
human. To obtain the full- length human and mouse cDNAs, several human and mouse ESTs and human 
5' and 3' RACE products were completely sequenced. This resulted in the isolation of a 

-1 4-kb full-length human sequence encoding 327 amino acids and a -1.4- kb mouse full-length 
sequence coding for 323 amino acids (Fig. 1), although several alternatively spliced products were 
detected in both cases (see below and Fig. 3B). 

Both cDNAs are polyadenylated, but lack polyadenylation signals, although AT- rich regions are present 
at the very 3' end of each cDNA. Mouse and human Nitl amino acid sequences were 90% identical; the 
human Nitl amino acid sequence was 58% similar and 50% identical to the C elegans mtnlase domam 
and 63% similar and 53% identical to the D. melanogaster nitrilase domain (Fig. 1). 

Murine lambda and human BAG genomic libraries were screened with the corresponding NIT] cDNA 
probes, yielding one mouse lambda clone and one human BAG clone containing the NIT] genes. The 
human' and murine NIT] genomic regions were sequenced and compared to the corresponding cDNA 
sequences. The genomic structure of human and mouse NIT] genes is shown in Fig. 3 A. Both genes are 
small- the human gene is -3.2 kb in size and contains 7 exons; the murine gene is -3.6 kb in size and 
contains 8 exons. Southern analysis confirmed that both human and mouse genomes harbor a smgle 
NIT] gene. 

A radiation hybrid mapping panel (GeneBridge 4) was used to determine the chromosomal localization 
of the human NIT] gene. By analysis of PGR data at the WhiteheadMIT database (http:/7www- 
cenome.wi.mit.edLi), the NIT] gene was localized 6.94 cR from the marker GHLG.GATA43 A04, 
which is located at 1 q2 1 - 1 q22. 

A full length murine Nit] cDNA probe was used to determine the chromosomal location of the murine 
gene by linkage analysis. Interspecific backcross analysis of 180 N2 mice demonstrated that the Nit] 
locus cosegregated with several previously mapped loci on distal mouse chromosome 1 . The region to 
which Nit] maps was further defined by PGR of genomic DNA from 92 N2 Mice usmg the markers 
DIM04, DIM05 and DIMit209 (Research Genetics). The following order of the genes typed in the cross 
and the'ratio of recombinants to N2 mice was obtained: centromere - DIM04 - 7/78 - DIM05 - 8/90 - 
Nitl - 11/91- D-lMit209 - telomere. The genetic distances given in centiMorgans (nS.E. ) are as follows: 
centromere - DlMit209 - 9.0 n 3.2 - DJMit35 - 8.9 n 3.0 - Nit] - 12.1 n3.4 - DIMit209 - telomere. This 
region of mouse chromosome I (lq21 - lq23) is syntenic to human chromosome Iq and is consistent with 
the localization of the human ortholog of Nit] . 

Expression and alternative splicing of hunian and murine Nit] genes For the human gene. Northern 
analysis revealed two major transcripts of 

-1 .4 kb and -2.4 kb in all adult tissues and tumor cell lines tested. A third band of 

-1 2 kb was observed in adult muscle and heart (Fig. 2B). The longest cDNA (-1 .4 kb) corresponds to 
the -1 4-kb transcript observed on Northern blots. The 1 .2-kb band corresponds to transcnpt I on Fig. 3B 
(see below). It is not known if the -2.4-kb RNA represents an additional transcript or an incompletely 
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nrocessed mRNA. No significant variation in human NIT] mRNA levels was observed in different 
tissues (Fig. 2B). On the contrary, different mouse tissues showed different levels of expression of Nit] 
mRNA (Fig. 2A). The highest levels of Nit] mRNA were observed in mouse liver and kidney (Hg. 2A, 
Middle, lanes 5 and 7). 

Interestingly the pattern of Nit] expression was almost identical to the pattern of the expression of Fhit 
(Fig. 2 A, Top and Middle), supporting the hypothesis that the proteins may act in concert or participate 
in the same pathway. 

Analysis of mouse Nit] ESTs revealed that some transcripts lack exon 2 and encode a 323 amino acid 
protein. An alternative transcript containing exon 2 encodes a shorter, 290 amino acid protein starting 
with the methionine 34 (Fig. 1). 

Analysis of human ESTs and 5' RACE products from HeLa and testis also suggested alternative 
nrocessine To investigate this, a series of RT- PGR experiments was carried out. Fig. 5 shows the 
results obtained from HeLa RNA (similar results were obtained using RNAs from the MDA-MB-436 
breast cancer cell line and adult liver). The alternatively spliced transcnpts are shown on Fig. 

3B Transcript 1, lacking exon 2, was represented by several ESTs in the Genbank EST database. This 
transcript probably corresponds to the .1.2-kb transcript observed on Northern blots in adult muscle and 
heart Transcript 2 encoding the 327 amino acid Nitl protein (Fig. 1) is a major tt-anscript of human NIT] 
at least in the cell lines tested. This transcript lacks exons la and lb. Transcnpt 3 has exon I a and l b; 
transcript 4 has exon I a but lacks exon 1 b (Fig. 3B). It is not known if tt-anscript 5 (lackmg exon 2) 
starts from exon I or 1 c. 

The alternative initiating methionines of different transcripts are shown on Fig. 3B. Data suggest that at 
least in COS-7 cells transfected with a construct containing transcript 2, the methionine in exon 3 
(shown in transcripts I and 3, Fig. 3B) initiates more efficiently than the methiomne m exon 2 (Fig. 3B, 
transcript 2). 

Discussion Although the frequent loss of Fhit expression in several common human cancers is well 
documented, and results supporting its tumor suppressor activity have been reported, the role of Fhit m 
normal and ttimor cell biology and its mechanism of its action in vivo are unknoxyn. The AP3A 
hvdrolytic activity of Fhit seems not to be required for its tumor suppressor fimction, and it has been 
suggested that the enzyme-subtract complex is the active form of Fhit. To facilitate an investigation of 
Fhit function, a model organisms approach was initiated by cloning and characterization of D. 
melanogaster and G. elegans Fhit genes. 

Surprisingly in flies and worms, Fhit is expressed as a fiision protein with the Fhit domain fiised into a 
"Nit-- domain showing homology to plant and bacterial nitrilases. Human and murine NIT] genes were 
further isolated. Nit and Fhit are expressed as separate proteins in mammals but, at the mRNA level, are 
coordinately expressed in mouse tissues. 

In several eukaryotic blosynthetic pathways multiple steps are catalyzed by multifimctional proteins 
containing two or more enzymatic domains. The same steps in prokaryotes frequently are carried out by 
monoenzymatic proteins that are homologs of each domain of the corresponding eukaryotic protem For 
example, Gars, Gart and Airs are domains of the same protein in D. melanogaster and mammals. These 
domains'catalyze different steps in de novo synthesis of purines. 

In yeast Gart homolog (Ade8).is a separate protein and Gars and Airs homologs (Ade5 and Ade7) are 
domains of a bienzymatic protein; in bacteria, all three homologs (PurM, PurN and Purl)) are separate 
proteins De novo pyrimidine biosynthesis illustrates a similar case. Recently, a fusion protein of a 
lipoxygenase and catalase, both participating in the metabolism of fatty acids, has been identified in 
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corals. In all of these examples, if domains of a multienzymatic protein in some organisms are expressed 
as individual proteins in other organisms, the individual proteins participate in the same pathways. This 
observation and the fact that Fhit and Nitl exhibit almost identical expression patterns in murine tissues 
suggest that Fhit and Nitl participate in the same cellular pathway in mammalian cells. 

Claims 



WHAT IS CLAIMED IS: 

1. A purified NIT] gene. 

2. The gene of claim I which is a human gene. 

3. The gene of claim I which is a mammalian gene. 



4. A purified Nitl protein. 

5. The protein of claim 4 which is a human protein. 

6. A purified protein encoded by a nucleic acid having a nucleotide sequence consisting of the coding 
region of SEQ ID NO: 1 . 

7. An antibody which is capable of binding a Nitl protein. 

8. The antibody of claim 7 which is monoclonal. 

9. A molecule comprising a fragment of the antibody of claim 7, which fragment is capable of binding a 
Nit I protein. 

10. An isolated nucleic acid of less than 100 kb, comprising a nucleotide sequence encoding a Nitl 
protein. 

1 1 . The nucleic acid of claim 1 0 in which the Nit I protein is a human Nit I protein. 

12. A pharmaceutical composition comprising a therapeutically effective amount of a Nitl protem; and a 
therapeutically acceptable carrier. 

13. A method of treating or preventing a disease or disorder in a subject comprising administering to 
said subject a therapeutically effective amount of a molecule that inhibits Nitl function. 

14. A method of treating or preventing a disease or disorder in a subject comprising administering to 
said subject a therapeutically effective amount of a molecule that enhances Nitl function. 

15. A method of diagnosing or screening for the presence of or a predisposition for developing a disease 
or disorder in a subject comprising detecting one or more mutations in NIT] DNA, RNA or Nit 1 protein 
derived from the subject in which the presence of said one or more mutations indicates the presence of 
the disease or disorder or a predisposition for developing the disease or disorder. 

16. A method of treating or preventing a disease or disorder in a subject by using a vector containing the 
NIT] gene coding sequence. 
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